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How Rings assign address space 

Stepl: aHgnircartittg address to self (to some power of 2) 
Step2. asagithe resxitto self address 

Step3: next_addr = self_addr + self_addr_space; // number of register usedlocally 
Step4: send downnext_addr 



Example: 

Dmaneeds 16adds 

Uartieeds4 self =32 
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-d 



clkl 



clk2 



If the delay between "clkl" and "clk2" 
greater then the delay from Q to d of 
second flipflop, we have a race on our 
meaning right hand flipflop will 
sample the data of Q a whole clock period 
early. 



compound A 



no 



> 



(logic 



compound B 



clock runs with data 

the problem is possible race. 
However, we control the logic on 
each flipflop leaving the compound, 
because it is always the same standart 
ring- interface module, we can ensure, 
that the delay will be at least enough. 
And more importantly easily checked 
after layout. 
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clock opposes data 

this arrangement has the advantage 
of auto ensuring the no race 
condition (at least in this simple 



data_b case ) exists 



V glka 

compound 



compound B 




data_a which changes after clkb, 
which is later then clka, is sampled 
by clka. NO RACE. 



r1° 



*9 
7 



clka 



Qal 



r 



90 
/ 



' T 



7? 



— v- 

OK \ 




72- 



compound B 



compound A 
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data Oft* 



clock 



clock 





clock 



ncertainty range 



data_a leaving the bridge goes to member "b" and 
there should be sampled by rising of clkb. clkb 
lags a lot behind clka of the bridge. As clearly seen 
from the waveforms, race is eminent. Here we 
should add latches for all the data lines (-90). 
Adding latch works however if the delay between 
clka and clkb is less then 75% of cycle time, 
otherwise the uncertainty kills the usable time. It 
sets hard limit on the number of ring members. 
Also keep in mind that latches needed on each OK 
signal between members of the ring 
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Here, data_b leaves member "b" to be sampled by 
elka in the bridge. But now elkb lags a lot behind 
elka. This actually works to our advantage, If the 
lag is smaller then better part of clock cycle. This 
solution looks better, because between adjacent 
members, we can take care to delay the datas 
beyond danger zone of clock delay, the OK signals 
are covered automatically, and last leg data is also 
covered. The only signal not safe is the OK from 
bridge to M b" member. Tt will need a latch in w b*\ 




F<9 



big module 



local_clock ,i0 

local_data_out 



data from 
previous mem her 

elk 




clock 



ring interface 



local clock lags behind 
ring_interface clock of this 
module, because we presume the 
module is big. for data_coming 
out, it is not a problem, it changes 
later then ring-i/f flipflops clock. 
However for data entering the 
module from previous member, 
the race is a possibility we must 
look into. 
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if module "a" sends a message to module "b'\ ring works 
fine. However if most of the traffic is from "c" to "b*\ 
this is more expensive in terms of latency. 



Another problem is "peak latency". Suppose that , "a" 
transmits mostly to "d" and "b" mostly to "c" In this case 
communication between "b" and "c" suffers degradation 
in case that peak traffic coincide. 
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Land bridge gets its name from the fact that it is 
a luxury. Tt spans across connected modules. 
The idea is simple. When V2 sends message to 
Dl it gets to one side of the bridge. This side 
analyzes the destination address and by some 
magic (explained later) decides to short-cut the 
path. The message re-appears at the other end of 
the bridge and gets fast to Dl . By same magic, 
message fromVl to D2 get bypassed also, 
message fromVl to Dl is treated directly. 




Enumeration is started by " Anchor" 
which assigns address= 1 to itself, results 
of enumeration are labels 1 to 7. land 
bridge gets two addresses t as if it were not 
one module, there is "near" end, that got 
enumeration label "3", and the "far*' end 
marked 6. 
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Bridge takes responcibility for strays, 
but only at the "far" end. During 
enumeration, bridge is "polarized" to 
have near and far end. Near is the end 
first struck by enumeration message. 



So we have exactly one enforcer for each 
ring. 



3 near 




11 far 



In land bridge ring, the situation is trickier. If V2 
send message to address==5. The land bridge 
divert at 1 1/far end. it will re-appear at 3 and 
start cycling forever. 

We have to define an algorithm that will take 
care of all cases. 

Luckily there is a way. 

Land Bridge deals only with messages arriving 
at the far end and being diverted. It marks and 
monitors only those. Messages arriving at near 
end, keep their markings. Messages at fdar end 
going through, are left alone. 



F '5 
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< 



type 



2ft addr^ 7 



64 data^ltf 



ok 
scan_test 



reset 



clk(?) 



i— 

1 



elk 



type 



ok 



idle / msgA 



X ms g R 



\idk 



during the first clock, OK remains active, when type 
is of msgA. It means that on the next clock, 
member A may send new message. memberA uses 
this ok to send msgB on the next clock. msgB gets 
stuck for a clock because OK goes inactive. It goes 
inactive because the fifo in memberB is full. One 
clock later, the fifo has a free entry, so OK returns to 
1 and type returns to idle next clock, return to idle 
could also be change to next message, if there was 
one. 



imsg iok 



omsg ook 



p, 't 




umsg 



uok 
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The incoming messages are examined first 
to see if it is supervisor or work/program. 
Work/program messages have address field. 
We check if it is our address. Since we know 
that our address is aligned to our power of 2, 
The address mask (named split mask) 
causes only certain number if upper bits to 
be compared. The lower part of the address 
is passed inside as internal address. The 
upper bits are compared against self-address 

register. This register gets its value during „ 

enumeration protocol. The lower part of this C comparator i 
register is always masked,. Hopefully I 
synthesis will delete the unused bits I 

implementation. ours/through part of the address 




incoming 
address 

address 
split mask 

self address 
register 



that enters the member 
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Member 



RifJL* Rif_* Rif_o_* module.id 



REP Address Space = 7 



Activation register 
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(^bbl) 









the second land bridge solves most traffic problems, but 
adds 4 clocks in the overall ring length. This is not a big 
problem because no message should travel the whole 
perimiter. 



3^ + 



3^ 




The Utopia interface is 
forced into mode that 
communicates in 
messages, not cells. We 
using the I/O and maybe 
some of the logic. 



Application 

Specific^ 
Accelerators 

CRC 
Encryption 
Table Lookup 
Hashing 

35^ 



Internal Memory 



3*2. 



Fast, Unified, Multi-port 



Doorbell! 



Vobla 

Network Processor 



Peripheral Expansion AresF^ 
Enet, ATM, Uart, USB, Serials 



v. 



System 
Expansion 
Area 

CPU (PP) 

DMA 
Smart FIFO 
Ext. mem I/F 
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Internal Memory 



FTU 



$1* 



LSU 



Program Sequertcvr 

PSU 



Vobla Core 



i 



3*4 



PrelfWtl&riump ^ BuiE ^ Register File 

PBU grfg RFU 



DALU 



31 0 



5 



1, 1, 1, 1, 1, 1 



Agent L'F 

AG1 1-3*2. 



Cocc Debug 



Dtxirbclt 



DMA Ageni 



E 



Agent Bus 



1 

37 



I'xecnij] Wwld L'F 



Vobla Compound] 
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32 register of 
32 bits per taik.^ 

A set of indtc&titmh 
per task, vw Inch 
control ta& execution ' 

Aft interface to 
adjacent resources ** 

t-asl memory accessed 
by load/*uwe 



ucneral 
purpose 
registers 



Agent 
interface 



Internal 
incmtfy 



Special 
purpose 
icgisterx 



P configuration 
giaters 



External 
memory 



per task and 
globai registers 



Initialised by 
the PP 



Big area aeccft&cil 
via a DMA interface 
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Rl register: 



Ft, 




s - sticky bit 

e<| - equal/ zero 

It - kis th«vnegativc 

Kt - greater then/ positive 

c - carry 

mb reflection of the RAM mutti-reader busy indication 



t/-3o 




»K»r index. > 2) 
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Frame structure 
of an example 
task type 



r27. 



43 



common 
task data 


task 

fragment 1 
data 






task fragment 
2 data 


task 

fragment 3 
data 


data of all lcvc!2 functions 


level 1 n 
data 


level l (2 
data 


level 1 ft 
data 







size oflcvclO 
frame part is 
different for 
each task 
type 

A size of Icvcl2 
! frame part it 
* constant 

size of level I 
frame part is 
different per 
each task 
type 
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3J> - Flip Flop 



3> -Logic 



APP ID=10064334 



Page 255 of 281 




APP ID=10064334 



Page 256 of 281 




APP ID=10064334 



Page 257 of 281 



IQ13!^!"3 3Mi" * 0702 0E 




typc_ m 

_interface_aikiress{ 1 5 .0 
f »_inlcrfacc_data[3 1 .0) 



type out[7:0](s.ijteuCRC ) 

ad drgss_ out[23:0] 

data out[63:0] 
(alscyoCRC) 
_^ P*l'lk 

ok2dnvc 



memory data 



message data 




agent commancj opcode 
(AID»murtlread 



vobla entry I , | i p ■ i s 
in multiread er 



increment first union bsi 
(op3) 



source addr in srain address of dcstcnalion number of bytes 

to transfer 
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Vobla 



agent interface 



J 



J 



mam 
entry 



shadow 
entry 



output 

message 
encoder 



type out[7:0] 
address out[23 :0] 
*data5uT[63:0] 





elk 




ok2drive 



agent command 



<optlon[6] =0) 



opcode 


options [9:0] 


RA 


A1D[4:0J 


RB/imm8 



me*sage_sender32< o,options[ 5 .0] raw_data[3 1 : 0] 



raw_address[23:U] 



t>Pc[7:0] | 



message data 



address_of_dcstcnalion message type 



agent command opcode 

(AlD=me*aage_s©Htmr) 

(option [6] =1) 



options[9:0] 



AID[4.0] 



mesaage.aendece j^^P 11 ^ 13 ^^! raw - data t 63:0 J | raw_addrcss{23.0] | lOOOOOOo] 



message data 



addrcssof destenation message type 



6% 




type_in[7:0] 



wnt? inicffocc,addrcssl23 :0] 
intcrfacc_daifl[3 1 :0] 
>ase_addrcss[7 :0] 

type out[7:0] 
address_out[23.0] 
Sta^!uT[63:0] 

el 



"A 



I 
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agent command 
(AID=dma agent f 

dma rogues t| M| J uj |**pj 

- tlify urgent dir autosaid lone dram address 
feffs (0 P2) (OPI)set ack addfess uram aaurcss 



fOP9) 



(OPO) <OP3)<OPIO) 



number of bytes 
to transfer or 
address modifyer 



multireade ' 



last in transfer 



l4st_in 



calculate^ re 



rans i 
frag ) 



rr julttrcadjata "31:0] 
2:i] 

tii cader 



lata si /x 



stall mu 



Vobla 



igent inlerfs ze 



stall vobla 
- 



register 




input 


file 




buffer 


♦ 





tx data mux 



on dmand 



CRC data 



I 



random 
number 
generato 
(TBD) 



ukct 

5,10.32 
machine; 



checksum 
machine 



bip16 
machine 



elk 



reset 



Ft $ • 



Ft*. ^ 



agent command 
(AID=CRC agent) 



4°i° 



opcode 


options[9:0] 


RA 

«. . . - ✓ i 


A1D[4:0] || RB j 




CRC type data size generateoperation ove/wnte 
check mode residue 
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Vobla 



igent interfa ce 



control 
register 



counter 




pre scatcF 4- 



div 

by 2 



time stamp 
register 



elk 



reset 



agont command 
(AlO=tlmor agent)' 




3 
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-i agent interface |~ 



Vobla 



setdoorbell^T 

next task_vaH( 



ask 



v current task ic [5:0] 
|v current task id va I id 
|v next_task_id[5 K)] 
v|f>orbell_req[2| Q1 
|v^Qorfaell_mask[2:0] 
1:0] 



input 

message 
decoder 



|^ v asct 



task mask 
registerfilc 



task 



task request 
register file 
and counter* 



TGMR 



mask control 
logic 



priority logic 



rif i wri te 
1 nF T doorbell cs 



irifi addr(5:0] 
fnfi dataf 4;01 
Tm set maskO 



]dm_set mask 1 
1 am - set mask2 



.dm set maskJ 




■ rif i re set 



rir i clo ck 



agent comma 
(AID=doorbelI 



opcode 



options [9:0] 



RA 



AID[4:0] 



RD/immB 



set/clear clear clear 
global request mask 
(OP2) (OP!) (OPO) 



{0,0,0,0.0,ma5,k bit index [2:0] } write mask 
or 

{0,0,0,0, 1 ,req_hit_index[2:0] } write request 
or 

{ l,0,0,0.0,count_value[2:0]J write counter 
or 

{0,1,0,0,0,0,0.01 write TGMR 
or 

{1,1 ,0,0,0,0,0,urgcnt value } write urgent 
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Memory 



Trajan chip 



Memory 
port 



^ " 



cs 



cs 



Daia 



Addr 



F1FQ„RD 



,FIFO_WR 



WR FIFO 



RD FIFO 



FPGA 



Encryption 



DSP 



PCI 



Host* I <PP) 
Hon 02 (DMA) 



Ring 
interface 



7 



Message sender 



Write request 
generator 



Read request 
generator 



WR FIFO used 
words counter 



n 



RD FIFO used 

words C 



Memory controller 



CS 
ADDR 



DATA 



FrFO_RD_ 



HFO_WR 



Host #3 (Ring extender) 



Trajan 



Ft 
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Trajan memory 
interface 



Trajan FIFO_RD ^ FIFO_R 5 
input 



Trajan FIFO.WR FIFO_WR 



DPR 



Host address 
generator 

RAM / FIFO 



Target address 
generator 

RAM / FIFO 



Synchronizer 



Synchronizer 



Synchronizer 



Interrupt 



FPGA 



Interface to 
external 
device 
(device 
specific) 



\ 
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setup registers: 



r ing in . 



doorbell, taskid and viscode 
urgent viscode 
header len and address 
threshold to urgent 




ring c 



free 



ovrn 



last 



size 



rx status word 
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setup registers: 



mac 



doorbell, taskid and viscode 
urgent viscode 
send ahead address 
threshold to transmit 
threshold to urgent 



ringjn 




doorbell 
free entry count 
inished frames coui 



ring c 
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_Propran 



irr_^_ 



is 



fPBit?fei3 rife 



S 



are I 

src2 



~3est 



Agent 1/F 

AOL 



4* 



^5 ^ 



Vobla Core 

.. . Address 

Agent Bus 



Jigf 



«p £°tjv > 



External World 




Instruction 1 


Instruction 


Address j 


Read source i 


Write result 


fetch request j 


decode ■ 


calculation, j 


Registers j 


into destination 




Data access req.J 


Data execution j 


register 




13 



- Flip Flop 



- l>ogic 
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xOSlUn»Canl 
ATM » ATM 




k f IP over ( ( Eth over "t3 {ErFoveT" 

| RFC 1483 U | RFC 1483 g 1 RFC 1463 

ImmJ gg J 

AAL-5 |* AAL-5 | 



ATM Device driver 



Utopia Interface 



ATM Device driver 




si 
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BSP 
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E1RIKS 



I AAU>, AAL/.AAU.AALO 
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Protocol specific 

data path 
functional blocks 




Ganeric data path 

processing 
functional blocks 



.1 EwMHosm 



Generic system 

service 
functional blocks 



Fault nod •xcopOon roport 



Vobla maintenance 
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